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Principal component analysis is considered as an addition to the well-tested parametrization 
w(a) = wo + w a (l — a) for the dark energy equation of state. This brief note cautions against some 
unjustified assumptions in interpretation of PCA calculations, giving quantified examples. 



I. INTRODUCTION 



II. UNCERTAINTIES ARE SMALL, SO WE 
KNOW THE ANSWER? 
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Dark energy is a premier mystery of cosmology and 
high energy physics. To address this, NASA and the De- 
partment of Energy (DOE) are funding the Joint Dark 
Energy Mission (JDEM). Because the physical nature of 
dark energy is so unknown, it is challenging to quantify 
simply and accurately the science requirements for learn- 
ing about the physical origin. To assess approaches to a 
figure of merit for the dark energy science reach of dif- 
ferent JDEM architectures, NASA and DOE formed the 
Figure of Merit Science Working Group (FOMSWG). 



FOMSWG found in a preliminary report [l| that the 
figure of merit used by the Dark Energy Task Force Q, 
given by the inverse area of the likelihood contour in the 
dark energy equation of state plane WQ-w a was reasonable 
for the task. Here the equation of state (EOS) as a func- 
tion of scale factor is given by w(a) = WQ+w a (l — a). This 
form has been tested for physical accuracy and against 
bias, and has been shown to faithfully reproduce relations 
for observables such as distances and Hubble parameters 
to an accuracy of 10 -3 [H, better than needed for JDEM. 



These results, and especially the calibration relations 
of [|[, should substantially allay concerns about using a 
particular functional form. Recall that the current issue 
is how to project simulated constraints of various JDEM 
scenarios; once the data are in hand one will carry out 
the analysis through a diversity of methods, and test spe- 
cific models directly against the data without an inter- 
mediate form. Nevertheless, one can reasonably consider 
an additional (not replacement) approach to build confi- 
dence that the conclusions on science design were robust. 
An example is principal component analysis (see, e.g., 
[1, H, 0,0] for application to dark energy), which has the 
capability of covering a wide variety of functional forms. 



The assessment of whether principal components 
(PCs) add new insights to JDEM design projections de- 
pends on appropriate scientific criteria. Here we present 
brief cautions about possible oversimplifications of in- 
terpretation, titling the following section headings with 
some of the possible misunderstandings. 



The principal components compose the EOS through 
w(a) -w h (a) ^^ ai ei(a), (1) 

i 

where Wb is the baseline EOS to compare to (e.g. w — 
— 1), ei are the eigenmodes of the Fisher matrix for the 
particular experiment, and an are the mode coefficients. 

Given the information from some (simulated) experi- 
ment, one forms the Fisher information matrix and di- 
agonalizes it. The rows of the diagonalization matrix are 
the eigenvectors, or modes ei(a). If the Fisher matrix 
was formed with respect to the baseline model, then the 
expectation value of the coefficients (c^) = 0. The rms 
about the mean is &i = cr(cvi), i.e. the inverse square root 
of the eigenvalues. 

One of the key misapprehensions of PCA is the physi- 
cal interpretation of the uncertainties on the amplitudes 
cti of each mode. Although PCs are often ordered by the 
uncertainties, these values o~i have no physical meaning 
by themselves. One has to interpret their magnitude in 
terms of some distance between models. One possibility 
is adopting the range of w G [—1,0], valid for many mod- 
els (with the upper limit to avoid disrupting early matter 
domination), and considering values of &i as useful if they 
are smaller than one. For example, FOMSWG imposes 
priors on the EOS such that oi cannot exceed one. This 
prescription is equivalent to a redefinition 
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One might choose to form a figure of merit given as 
the reciprocal of the product of all the oVs. This appears 
similar to a multi-dimensional extension (cf. 0]) of the 
DETF figure of merit written in terms of l/[a(w p ) a(w a )] 
where w p is the pivot EOS value. However, none of this 
addresses the meaning. Since every PC by construction 
has an uncertainty smaller than or equal to one, does 
this mean that all give physical insight? As argued by 
[UH, Q, the Ci's alone capture no physics. The imposi- 
tion of a prior does define a goal for "smallness" of the 
uncertainty, if somewhat artificially. However, just as 
in most astronomy, the true information is held by the 
signal-to-noise ratio, 
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where each mode contributes a, /cr, worth of S/AT 0, [l(| • 
This is equivalent to the \ 2 comparison of mode coeffi- 
cients between a model and the baseline. So the impor- 
tant quantity is not Oi but ai/cti. 

Consider if one misestimated the value of some PC co- 
efficient on by Oi (i.e. la) - would it change the physics 
conclusion? In particular, the cosmological constant cor- 
responds to on = for all PCs. If some is within la 
(or some other confidence level) of zero, then that PC is 
not effectively contributing to distinction of the physics 
from a cosmological constant, since there is a high sta- 
tistical probability that en is consistent with zero. (This 
can of course be generalized to differences between any 
two models.) 

Therefore, how many PCs truly contribute to distin- 
guishing some model from the cosmological constant A? 
We test this for a continuum of models, over various 
model classes, assuming an imaginary experiment giv- 
ing 0.3% distance measurements between z — — 3, plus 
Planck CMB information. Every one of 36 PCs between 
a = 0.1 — 1 have a^s less than one, by the construction 
of Eq. However, when we carry out the test for vari- 
ous model parameters, and model classes, below, we find 
generically that only ~2-3 PCs have ai/on < 1/2, i.e. a 
2<T deviation from the cosmological constant, even with 
the extraordinary assumptions of experimental accuracy. 
This is in agreement with the multiple studies of this 
issue by [![. 

Figures [T|[2] illustrate the physical discriminating 
power of this idealized experiment as a function of cos- 
mological model. For most models viable under current 
data, 2-3 PCs have sufficient S/N to be useful in distin- 
guishing between the true cosmology and the cosmologi- 
cal constant. Of course as the model approaches the cos- 
mological constant, distinction becomes more difficult. 

To exhibit robustness of these results against the spe- 
cific assumed model, we scan over model parameters for 
classes of qualitatively different dark energy physics, rep- 
resenting the thawing class, freezing class, and a non- 
monotonic EOS. In the last case, we see that the abil- 
ity of PCA not to be locked into a particular functional 
form, e.g. a monotonic parametrization, does not make 
more degrees of freedom significant. 

Again, what we really care about is the S/N. In TableU 
we list the fraction of the total S/N (Eq. [3]) contributed 
by the two best modes - for the case for each dark energy 
class where higher modes contribute the most. For the 
thawing class the higher modes add less than 0.3% to 
the total, and for the freezing class less than 2.8% in the 
most sensitive case, dropping to less than 0.5% for modes 
above the third. And recall this was for a highly idealized 
experiment. In the oscillating model, an ad hoc case 
designed to be especially PCA-friendly, the most extreme 
case with oscillations reaching w = allows higher modes 
to contribute up to 14% (8% for above the third mode). 
These are actually overestimates of the importance of 
high modes because as discussed in the next section the 
S/N of the higher modes degrades when w(z > 9) is 
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FIG. 1: Noise-to-signal ratios Oi/on vs. deviation from the 
cosmological constant for models with w(a) = — l + ly a (l — a) 
in the freezing class (top), and w(a) = ( — 1 — w a ) -\- w a (1 — a) 
in the thawing class (bottom) . Approaching the cosmological 
constant, w a — > 0, the models become difficult to distinguish 
from a = 0, i.e. A. Only 3-5 PCs have S/N > 1, for any 
of the models. The intersection of the horizontal line with a 
PC shows where that mode gives a 2a indication of difference 
from A. For \w a \ < 0.4, only 2 PCs meet this criterion. (Note 
breaks in the PC5 curve come from «5 switching sign.) 



marginalized over rather than fixed. 

Interestingly, fitting Wo-w a rather than using PCA pro- 
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FIG. 2: As Fig. [T] but for to (a) = -1 + A[l - cos(lna)] with 
non-monotonic behavior. Note w would exceed at some 
redshifts for A > 0.5. 



Model 


(S/jV) 2 /(S/iV) all 


(S/N) 3 /(S/N) M 


Freezing (tw a = 0.7) 


0.972 


0.995 


Thawing (w a — —0.5) 


0.997 


0.9998 


Oscillating (A = 0.5) 


0.862 


0.922 



TABLE I: Fraction of total signal-to-noise contributed by the 
first two, or three, principal components for the case in each 
dark energy class most favoring PCA high modes. 



vides distinction from the cosmological constant at the 
la (2a) level for |to„| = 0.13 (0.24) - very comparable 
to the PCA approach. That is, the second parameter 
becomes useful at almost the same values as in the top 
panel of Fig. [1] for the PCA freezing case, and is more 
sensitive than in the bottom panel for the PCA thawing 
case. Both methods demonstrate that significant physi- 
cal constraints on the dark energy EOS are described by 
of order two quantities. 

The main point though is that the important informa- 
tion is not in Cj, but aijon- Just because PCA may say 
uncertainties ai are small, this does not mean that we 
know the physics answer. 



III. WHAT HAPPENS AT HIGH REDSHIFTS, 
STAYS AT HIGH REDSHIFTS? 

While the previous demonstration of PC uncertainties 
and signal-to-noise is the most important of this paper, 



we also note that assumptions about the high redshift 
EOS behavior have significant effects. In practice, the 
PCs are often computed assuming a cut off at some max- 
imum redshift to avoid complications in calculating the 
cosmic microwave background (CMB) primordial power 
spectra and the initial conditions for growth of matter 
perturbations. At higher redshifts one must therefore 
choose a particular form or value for the EOS. FOM- 
SWG fixes w(z > 9) = — 1. One justification is that for 
the cosmological constant the dark energy density fades 
away quickly into the past, so the exact value of w there 
is unimportant. However, we have no guarantee that the 
cosmological constant is the correct model, nor even es- 
sentially any current information on the behavior of the 
EOS at z > 1 Assumptions about the high redshift 
behavior can lead to significant biases and improper con- 
clusions about the nature of dark energy (see, e.g., [6|). 

Bias should not be as severe a problem for a high tran- 
sition redshift, z = 9, and with many redshift bins at 
z > 3 the extra degrees of freedom should ameliorate bias 
from the prior at z > 9. However, fixing w(z > 9) = — 1 
does demonstrably influence the PCs: for example some 
of the uncertainties cr, / ai that are apparently tightly de- 
termined can degrade by a factor three when w(z > 9) is 
not fixed. The modes themselves also change shape, as 
we discuss next. 

Thus, what happens at high redshift does not stay at 
high redshift, but can affect some important aspects of 
the principal component analysis. 



IV. HIGHEST IS BEST? 

Does the location in redshift of the maximum of a PC, 
say the first one, say something fundamental about the 
science reach of the survey or probe employed? No - as 
is clear from Eq. (Q]) the EOS constraints follow from the 
sum - with both positive and negative contributions - 
over all the PCs, not any single one. 

Moreover, Figure[3]demonstrates that artificially fixing 
the high redshift EOS behavior changes the PC shape 
and peak location. This shift has nothing to do with 
the experimental design and so the peak location is not 
a signpost to experiment optimization. Assumptions on 
w(z > 9) can affect probes differently: e.g. supernova 
distances do not involve w(z > 9) while baryon acoustic 
oscillations, being tied to high redshift, do. 

Finally, even if a PC peak did mean that the experi- 
ment is most sensitive to the dark energy EOS at a higher 
redshift, say, that would not imply that the experiment 
is most sensitive to the nature of dark energy. For ex- 
ample, dark energy is most influential today, so perhaps 
one wants an experiment most sensitive to the low red- 
shift behavior. (We emphasize that understanding dark 
energy at low redshift still requires measuring expansion 
and growth to high redshift, to break degeneracies.) At 
best, one could say that probes that weight the dark en- 
ergy differently in redshift have some complementarity. 
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But there is no justification for claiming that the probe 
with the highest peak, or with the peak at the highest 
redshift, is the best probe. 



V. CONCLUSIONS 

Principal component analysis is a valid technique, 
used appropriately. Oversimplifying PCA interpretation 
or inadequately appreciating the effect of assumptions 
employed can lead to misunderstandings and false be- 
liefs. We present cautionary examples of three appar- 
ently plausible but unjustified extrapolations. While 
data should be analyzed in every reasonable manner, 
for understanding the generic cosmology reach the more 
complicated PCA approach demonstrates no extraordi- 
nary advantage over the well-tested and highly calibrated 
phase space dynamics approach of WQ-w a . 



FIG. 3: The location and shape of peaks in principal com- 
ponent modes depend on the high redshift treatment of the 
EOS. The peak location therefore does not in itself translate 
into the science impact of a given experiment. Note the effects 
are more severe for a more realistic experiment. 
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